AITopics | bessel function

Multimodal contrastive learning (MCL) aims to embed data from different modalities in a shared embedding space. However, empirical evidence shows that representations from different modalities occupy completely separate regions of embedding space, a phenomenon referred to as the modality gap. Moreover, experimental findings on how the size of the modality gap influences downstream performance are inconsistent. These observations raise two key questions: (1) What causes the modality gap? (2) How does it affect downstream tasks? To address these questions, this paper introduces the first theoretical framework for analyzing the convergent optimal representations of MCL and the modality alignment when training is optimized. Specifically, we prove that without any constraint or under the cone constraint, the modality gap converges to zero. Under the subspace constraint (i.e., representations of two modalities fall into two distinct hyperplanes due to dimension collapse), the modality gap converges to the smallest angle between the two hyperplanes. This result identifies \emph{dimension collapse} as the fundamental origin of the modality gap. Furthermore, our theorems demonstrate that paired samples cannot be perfectly aligned under the subspace constraint. The modality gap influences downstream performance by affecting the alignment between sample pairs. We prove that, in this case, perfect alignment between two modalities can still be achieved via two ways: hyperplane rotation and shared space projection.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2510.03268

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.45)

Add feedback

618790ae971abb5610b16c826fb72d01-Supplemental.pdf

Neural Information Processing SystemsOct-3-2025, 01:36:27 GMT

artificial intelligence, estimator, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.93)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

618790ae971abb5610b16c826fb72d01-Paper.pdf

Neural Information Processing SystemsOct-3-2025, 01:36:21 GMT

artificial intelligence, estimator, machine learning, (13 more...)

Neural Information Processing Systems

Country: North America > United States (1.00)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

33cc2b872dfe481abef0f61af181dfcf-Supplemental.pdf

Neural Information Processing SystemsOct-2-2025, 15:32:02 GMT

artificial intelligence, machine learning, rotation, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Equivariance by Local Canonicalization: A Matter of Representation

Gerhartz, Gerrit, Lippmann, Peter, Hamprecht, Fred A.

arXiv.org Artificial IntelligenceOct-1-2025

Equivariant neural networks offer strong inductive biases for learning from molecular and geometric data but often rely on specialized, computationally expensive tensor operations. We present a framework to transfers existing tensor field networks into the more efficient local canonicalization paradigm, preserving equivariance while significantly improving the runtime. Within this framework, we systematically compare different equivariant representations in terms of theoretical complexity, empirical runtime, and predictive accuracy. We publish the tensor frames package, a PyTorchGeometric based implementation for local canonicalization, that enables straightforward integration of equivariance into any standard message passing neural network.

artificial intelligence, machine learning, representation, (12 more...)

arXiv.org Artificial Intelligence

2509.26499

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Auxiliary Lemmas

Neural Information Processing SystemsAug-19-2025, 13:42:18 GMT

We present some preliminary lemmas in this section. Most of them are basic inequalities in Information Theory, so they are only for auxiliary purposes in our proofs of the main theorems and the corollaries. We refer to Lemma 6.2 of the book Gray ( 2011). We refer to Theorem 14 of the article Liese and Vajda ( 2006). Assume that there are M disjoint / 2 -balls.

artificial intelligence, inequality, posterior distribution, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

Achieving Rotational Invariance with Bessel-Convolutional Neural Networks

Neural Information Processing SystemsAug-18-2025, 19:08:22 GMT

As of today, Convolutional Neural Networks (CNN) are one of the most powerful tools for image analysis. They achieve, thanks to convolutions, an invariance with respect to translations.

artificial intelligence, deep learning, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > Belgium > Wallonia > Namur Province > Namur (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Belgium > Flanders > Antwerp Province > Antwerp (0.04)
Asia > India (0.04)

Industry: Health & Medicine (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Filters

Collaborating Authors

bessel function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

f18224a1adfb7b3dbff668c9b655a35a-Paper.pdf

618790ae971abb5610b16c826fb72d01-Paper.pdf

33cc2b872dfe481abef0f61af181dfcf-Supplemental.pdf

Decipher the Modality Gap in Multimodal Contrastive Learning: From Convergent Representations to Pairwise Alignment

618790ae971abb5610b16c826fb72d01-Supplemental.pdf

618790ae971abb5610b16c826fb72d01-Paper.pdf

33cc2b872dfe481abef0f61af181dfcf-Supplemental.pdf

Equivariance by Local Canonicalization: A Matter of Representation

A Auxiliary Lemmas

Achieving Rotational Invariance with Bessel-Convolutional Neural Networks